Class visualization of high-dimensional data with applications

نویسندگان

  • Inderjit S. Dhillon
  • Dharmendra S. Modha
  • W. Scott Spangler
چکیده

Consider the problem of visualizing high-dimensional data that has been categorized into various classes. Our goal in visualizing is to quickly absorb inter-class and intra-class relationships. Towards this end, class-preserving projections of the multidimensional data onto twodimensional planes, which can be displayed on a computer screen, are introduced. These class-preserving projections maintain the high-dimensional class structure, and are closely related to Fisher’s linear discriminants. By displaying sequences of such two-dimensional projections and by moving continuously from one projection to the next, an illusion of smooth motion through a multidimensional display can be created. We call such sequences class tours. Furthermore, we overlay class-similarity graphs on our two-dimensional projections to capture the distance relationships in the original high-dimensional space. We illustrate the above visualization tools on the classical Iris plant data, the ISOLET spoken letter data, and the PENDIGITS on-line handwriting data set. We show how our visual examination of the data can uncover latent class relationships.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Lvq Algorithm with Data-structure Preserving Visualization

Data-structure preserved visualization of high-dimensional data reveals the dataset borders and the spread and overlapping tendency of the class borders in a more informative manner than the usual data-topology preserved mapping produced by SelfOrganizing Maps (SOMs). Hence, an extension of SOM called Probabilistic Regularized SOM (PRSOM) is proposed for the data-structure preservation in the v...

متن کامل

Solving a class of nonlinear two-dimensional Volterra integral equations by using two-dimensional triangular orthogonal functions

In this paper, the two-dimensional triangular orthogonal functions (2D-TFs) are applied for solving a class of nonlinear two-dimensional Volterra integral equations. 2D-TFs method transforms these integral equations into a system of linear algebraic equations. The high accuracy of this method is verified through a numerical example and comparison of the results with the other numerical methods.

متن کامل

How Porous Nanofibers Have Enhanced the Engineering of Advanced Materials: A Review

Nanofibers are one-dimensional nanomaterialswith a superfine diameter and many potential applicationsdue to their desirable characteristics such as small diameter,high surface area, high flexibility, high porosity, and specialmechanical properties. In the recent years, porous nanofibershave been the subject of considerable research works in awide range of app...

متن کامل

A new approach for data visualization problem

Data visualization is the process of transforming data, information, and knowledge into visual form, making use of humans’ natural visual capabilities which reveals relationships in data sets that are not evident from the raw data, by using mathematical techniques to reduce the number of dimensions in the data set while preserving the relevant inherent properties. In this paper, we formulated d...

متن کامل

Doubly supervised embedding based on class labels and intrinsic clusters for high-dimensional data visualization

Visualization of data can assist decision-making processes by presenting the underlying information in a perceptible manner. Many dimension reduction techniques have been proposed to generate faithful visualization snapshots given high-dimensional data. When class labels associated with the data are already provided, supervised dimension reduction methods, which utilize such pre-given label inf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 41  شماره 

صفحات  -

تاریخ انتشار 2002